Clustering for Research and Production Scale, Parallel and Distributed Computing
نویسنده
چکیده
A lot of attention has been paid to so-called Beowulf/Avalon clusters, where PCs or Alphas are strung together with 100Mbit/s Ethernet and portable programs from supercomputers have been run on these, particularly when modest bandwidth and latency requirements are posed by the example applications. In addition, heroic efforts to scale clusters using early gigabit/s scalable fabrics has been done across the world, but these systems, like the Beowulf counterparts, have relied on software from the previous generation of multicomputers and supercomputing systems. However, commercial-grade software tools (middleware and distributed environments) for clusters have matured considerably since the initial Beowulf type experiments, as have the availability of easy-to-use cluster interconnects. In this talk we review the technical achievements thus far in production-grade environments for both message passing and cluster scheduling, both for NT and Linux. This talk emphasizes the option of having tools and hardware that is scalable, to varying degrees, and presents a taxonomy of hardware, software, and applications that divides the space of activities and also seeks to establish areas where additional opportunities for new software and other tools exist. Issues of security and scalability are considered as are cost of ownership vs. freeware, with proposed economic models for both company and university adopters of clusters. We discuss pros and cons of open source vs. commercial products as viable options moving forward.
منابع مشابه
Entropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملGreen Energy-aware task scheduling using the DVFS technique in Cloud Computing
Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...
متن کاملStatic Task Allocation in Distributed Systems Using Parallel Genetic Algorithm
Over the past two decades, PC speeds have increased from a few instructions per second to several million instructions per second. The tremendous speed of today's networks as well as the increasing need for high-performance systems has made researchers interested in parallel and distributed computing. The rapid growth of distributed systems has led to a variety of problems. Task allocation is a...
متن کاملTowards Parallel and Distributed Computing in Large-Scale Data Mining: A Survey
The implementation of data mining ideas in high-performance parallel and distributed computing environments is becoming crucial for ensuring system scalability and interactivity as data continues to grow inexorably in size and complexity. This paper is a survey on the parallelization of well-known data mining techniques covering classification, link analysis, clustering and sequential learning,...
متن کاملParallel computing using MPI and OpenMP on self-configured platform, UMZHPC.
Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...
متن کاملA Clustering Approach to Scientific Workflow Scheduling on the Cloud with Deadline and Cost Constraints
One of the main features of High Throughput Computing systems is the availability of high power processing resources. Cloud Computing systems can offer these features through concepts like Pay-Per-Use and Quality of Service (QoS) over the Internet. Many applications in Cloud computing are represented by workflows. Quality of Service is one of the most important challenges in the context of sche...
متن کامل